Dataset statistics
| Number of variables | 15 |
|---|---|
| Number of observations | 4703 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 1 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 523.6 KiB |
| Average record size in memory | 114.0 B |
Variable types
| Numeric | 12 |
|---|---|
| Boolean | 2 |
| Categorical | 1 |
| Dataset has 1 (< 0.1%) duplicate rows | Duplicates |
actor_1_fb_likes is highly overall correlated with other_actors_fb_likes | High correlation |
budget is highly overall correlated with gross and 1 other fields | High correlation |
country_UK is highly overall correlated with country_USA | High correlation |
country_USA is highly overall correlated with country_UK | High correlation |
critic_reviews_ratio is highly overall correlated with title_year | High correlation |
gross is highly overall correlated with budget and 1 other fields | High correlation |
num_voted_users is highly overall correlated with budget and 1 other fields | High correlation |
other_actors_fb_likes is highly overall correlated with actor_1_fb_likes | High correlation |
title_year is highly overall correlated with critic_reviews_ratio | High correlation |
country_UK is highly imbalanced (56.6%) | Imbalance |
budget is highly skewed (γ1 = 49.02395721) | Skewed |
director_fb_likes has 825 (17.5%) zeros | Zeros |
facenumber_in_poster has 2019 (42.9%) zeros | Zeros |
movie_fb_likes has 2086 (44.4%) zeros | Zeros |
Reproduction
| Analysis started | 2024-04-11 22:27:01.952306 |
|---|---|
| Analysis finished | 2024-04-11 22:27:29.366066 |
| Duration | 27.41 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
duration
Real number (ℝ)
| Distinct | 164 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 108.63066 |
| Minimum | 14 |
|---|---|
| Maximum | 330 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 14 |
|---|---|
| 5-th percentile | 84 |
| Q1 | 94 |
| median | 104 |
| Q3 | 118 |
| 95-th percentile | 146 |
| Maximum | 330 |
| Range | 316 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 22.562204 |
|---|---|
| Coefficient of variation (CV) | 0.20769646 |
| Kurtosis | 11.779179 |
| Mean | 108.63066 |
| Median Absolute Deviation (MAD) | 11 |
| Skewness | 2.2280838 |
| Sum | 510890 |
| Variance | 509.05305 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 143 | 3.0% |
| 100 | 134 | 2.8% |
| 98 | 130 | 2.8% |
| 101 | 130 | 2.8% |
| 97 | 125 | 2.7% |
| 93 | 120 | 2.6% |
| 99 | 120 | 2.6% |
| 94 | 120 | 2.6% |
| 95 | 119 | 2.5% |
| 106 | 108 | 2.3% |
| Other values (154) | 3454 |
| Value | Count | Frequency (%) |
| 14 | 1 | |
| 20 | 1 | |
| 25 | 1 | |
| 34 | 1 | |
| 37 | 1 | |
| 41 | 1 | |
| 45 | 2 | |
| 46 | 1 | |
| 47 | 1 | |
| 53 | 1 |
| Value | Count | Frequency (%) |
| 330 | 1 | |
| 325 | 1 | |
| 300 | 1 | |
| 293 | 1 | |
| 289 | 1 | |
| 280 | 1 | |
| 271 | 1 | |
| 270 | 1 | |
| 251 | 2 | |
| 240 | 2 |
director_fb_likes
Real number (ℝ)
ZEROS 
| Distinct | 429 |
|---|---|
| Distinct (%) | 9.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 710.17223 |
| Minimum | 0 |
|---|---|
| Maximum | 23000 |
| Zeros | 825 |
| Zeros (%) | 17.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 8 |
| median | 52 |
| Q3 | 209 |
| 95-th percentile | 1000 |
| Maximum | 23000 |
| Range | 23000 |
| Interquartile range (IQR) | 201 |
Descriptive statistics
| Standard deviation | 2861.8195 |
|---|---|
| Coefficient of variation (CV) | 4.0297542 |
| Kurtosis | 26.029513 |
| Mean | 710.17223 |
| Median Absolute Deviation (MAD) | 52 |
| Skewness | 5.1181934 |
| Sum | 3339940 |
| Variance | 8190010.9 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 825 | 17.5% |
| 3 | 65 | 1.4% |
| 6 | 61 | 1.3% |
| 7 | 58 | 1.2% |
| 11 | 56 | 1.2% |
| 2 | 56 | 1.2% |
| 4 | 54 | 1.1% |
| 10 | 51 | 1.1% |
| 12 | 48 | 1.0% |
| 5 | 48 | 1.0% |
| Other values (419) | 3381 |
| Value | Count | Frequency (%) |
| 0 | 825 | |
| 2 | 56 | 1.2% |
| 3 | 65 | 1.4% |
| 4 | 54 | 1.1% |
| 5 | 48 | 1.0% |
| 6 | 61 | 1.3% |
| 7 | 58 | 1.2% |
| 8 | 47 | 1.0% |
| 9 | 46 | 1.0% |
| 10 | 51 | 1.1% |
| Value | Count | Frequency (%) |
| 23000 | 1 | < 0.1% |
| 22000 | 8 | 0.2% |
| 21000 | 10 | 0.2% |
| 18000 | 4 | 0.1% |
| 17000 | 20 | |
| 16000 | 28 | |
| 15000 | 2 | < 0.1% |
| 14000 | 30 | |
| 13000 | 26 | |
| 12000 | 17 |
actor_1_fb_likes
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 843 |
|---|---|
| Distinct (%) | 17.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 6817.3957 |
| Minimum | 0 |
|---|---|
| Maximum | 640000 |
| Zeros | 14 |
| Zeros (%) | 0.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 116.1 |
| Q1 | 637 |
| median | 1000 |
| Q3 | 11000 |
| 95-th percentile | 24000 |
| Maximum | 640000 |
| Range | 640000 |
| Interquartile range (IQR) | 10363 |
Descriptive statistics
| Standard deviation | 14982.445 |
|---|---|
| Coefficient of variation (CV) | 2.1976786 |
| Kurtosis | 720.98565 |
| Mean | 6817.3957 |
| Median Absolute Deviation (MAD) | 790 |
| Skewness | 19.549467 |
| Sum | 32062212 |
| Variance | 2.2447365 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1000 | 417 | 8.9% |
| 11000 | 207 | 4.4% |
| 2000 | 187 | 4.0% |
| 3000 | 148 | 3.1% |
| 12000 | 133 | 2.8% |
| 13000 | 126 | 2.7% |
| 14000 | 121 | 2.6% |
| 10000 | 109 | 2.3% |
| 18000 | 108 | 2.3% |
| 22000 | 79 | 1.7% |
| Other values (833) | 3068 |
| Value | Count | Frequency (%) |
| 0 | 14 | |
| 2 | 6 | |
| 3 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 4 | 0.1% |
| 6 | 3 | 0.1% |
| 7 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| 11 | 2 | < 0.1% |
| 12 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 640000 | 1 | < 0.1% |
| 260000 | 2 | < 0.1% |
| 164000 | 2 | < 0.1% |
| 137000 | 2 | < 0.1% |
| 87000 | 8 | 0.2% |
| 77000 | 1 | < 0.1% |
| 49000 | 27 | |
| 46000 | 1 | < 0.1% |
| 45000 | 5 | 0.1% |
| 44000 | 2 | < 0.1% |
gross
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 4146 |
|---|---|
| Distinct (%) | 88.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45085643 |
| Minimum | 162 |
|---|---|
| Maximum | 7.6050585 × 108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 162 |
|---|---|
| 5-th percentile | 100669.6 |
| Q1 | 6494675 |
| median | 24848292 |
| Q3 | 54548936 |
| 95-th percentile | 1.7099911 × 108 |
| Maximum | 7.6050585 × 108 |
| Range | 7.6050568 × 108 |
| Interquartile range (IQR) | 48054262 |
Descriptive statistics
| Standard deviation | 64148103 |
|---|---|
| Coefficient of variation (CV) | 1.4228055 |
| Kurtosis | 16.705866 |
| Mean | 45085643 |
| Median Absolute Deviation (MAD) | 20807704 |
| Skewness | 3.3289438 |
| Sum | 2.1203778 × 1011 |
| Variance | 4.1149791 × 1015 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 24848292 | 458 | 9.7% |
| 5000000 | 4 | 0.1% |
| 3000000 | 3 | 0.1% |
| 218051260 | 3 | 0.1% |
| 177343675 | 3 | 0.1% |
| 8000000 | 3 | 0.1% |
| 13401683 | 2 | < 0.1% |
| 800000 | 2 | < 0.1% |
| 22494487 | 2 | < 0.1% |
| 21028755 | 2 | < 0.1% |
| Other values (4136) | 4221 |
| Value | Count | Frequency (%) |
| 162 | 1 | |
| 423 | 1 | |
| 607 | 1 | |
| 703 | 1 | |
| 721 | 1 | |
| 728 | 1 | |
| 828 | 1 | |
| 1029 | 1 | |
| 1100 | 1 | |
| 1111 | 1 |
| Value | Count | Frequency (%) |
| 760505847 | 1 | |
| 658672302 | 1 | |
| 652177271 | 1 | |
| 623279547 | 1 | |
| 533316061 | 1 | |
| 474544677 | 1 | |
| 460935665 | 1 | |
| 458991599 | 1 | |
| 448130642 | 1 | |
| 436471036 | 1 |
num_voted_users
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 4593 |
|---|---|
| Distinct (%) | 97.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 87783.318 |
| Minimum | 5 |
|---|---|
| Maximum | 1689764 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 1099.4 |
| Q1 | 10774 |
| median | 37952 |
| Q3 | 101938 |
| 95-th percentile | 343205.1 |
| Maximum | 1689764 |
| Range | 1689759 |
| Interquartile range (IQR) | 91164 |
Descriptive statistics
| Standard deviation | 140733.28 |
|---|---|
| Coefficient of variation (CV) | 1.6031894 |
| Kurtosis | 23.651174 |
| Mean | 87783.318 |
| Median Absolute Deviation (MAD) | 32809 |
| Skewness | 3.9557772 |
| Sum | 4.1284494 × 108 |
| Variance | 1.9805856 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3119 | 3 | 0.1% |
| 2541 | 3 | 0.1% |
| 3665 | 3 | 0.1% |
| 9903 | 2 | < 0.1% |
| 6069 | 2 | < 0.1% |
| 80639 | 2 | < 0.1% |
| 25870 | 2 | < 0.1% |
| 1231 | 2 | < 0.1% |
| 3943 | 2 | < 0.1% |
| 53 | 2 | < 0.1% |
| Other values (4583) | 4680 |
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 19 | 1 | |
| 28 | 1 | |
| 37 | 1 | |
| 40 | 1 | |
| 47 | 1 | |
| 48 | 1 | |
| 50 | 1 | |
| 53 | 2 | |
| 59 | 1 |
| Value | Count | Frequency (%) |
| 1689764 | 1 | |
| 1676169 | 1 | |
| 1468200 | 1 | |
| 1347461 | 1 | |
| 1324680 | 1 | |
| 1251222 | 1 | |
| 1238746 | 1 | |
| 1217752 | 1 | |
| 1215718 | 1 | |
| 1155770 | 1 |
facenumber_in_poster
Real number (ℝ)
ZEROS 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.3567935 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 2019 |
| Zeros (%) | 42.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 2.0086637 |
|---|---|
| Coefficient of variation (CV) | 1.4804491 |
| Kurtosis | 55.770377 |
| Mean | 1.3567935 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.5646736 |
| Sum | 6381 |
| Variance | 4.03473 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2019 | |
| 1 | 1179 | |
| 2 | 665 | 14.1% |
| 3 | 359 | 7.6% |
| 4 | 190 | 4.0% |
| 5 | 100 | 2.1% |
| 6 | 67 | 1.4% |
| 7 | 45 | 1.0% |
| 8 | 34 | 0.7% |
| 9 | 15 | 0.3% |
| Other values (9) | 30 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 2019 | |
| 1 | 1179 | |
| 2 | 665 | 14.1% |
| 3 | 359 | 7.6% |
| 4 | 190 | 4.0% |
| 5 | 100 | 2.1% |
| 6 | 67 | 1.4% |
| 7 | 45 | 1.0% |
| 8 | 34 | 0.7% |
| 9 | 15 | 0.3% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 15 | 5 | 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| 12 | 4 | 0.1% |
| 11 | 5 | 0.1% |
| 10 | 10 | |
| 9 | 15 |
budget
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 432 |
|---|---|
| Distinct (%) | 9.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39306827 |
| Minimum | 218 |
|---|---|
| Maximum | 1.22155 × 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 218 |
|---|---|
| 5-th percentile | 800000 |
| Q1 | 7500000 |
| median | 20000000 |
| Q3 | 40000000 |
| 95-th percentile | 1.25 × 108 |
| Maximum | 1.22155 × 1010 |
| Range | 1.22155 × 1010 |
| Interquartile range (IQR) | 32500000 |
Descriptive statistics
| Standard deviation | 2.02669 × 108 |
|---|---|
| Coefficient of variation (CV) | 5.1560762 |
| Kurtosis | 2820.5211 |
| Mean | 39306827 |
| Median Absolute Deviation (MAD) | 15000000 |
| Skewness | 49.023957 |
| Sum | 1.8486001 × 1011 |
| Variance | 4.1074723 × 1016 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20000000 | 442 | 9.4% |
| 30000000 | 145 | 3.1% |
| 15000000 | 141 | 3.0% |
| 25000000 | 139 | 3.0% |
| 10000000 | 137 | 2.9% |
| 40000000 | 131 | 2.8% |
| 35000000 | 120 | 2.6% |
| 50000000 | 104 | 2.2% |
| 5000000 | 102 | 2.2% |
| 60000000 | 94 | 2.0% |
| Other values (422) | 3148 |
| Value | Count | Frequency (%) |
| 218 | 1 | < 0.1% |
| 1100 | 1 | < 0.1% |
| 4500 | 1 | < 0.1% |
| 7000 | 3 | |
| 9000 | 1 | < 0.1% |
| 10000 | 2 | |
| 14000 | 1 | < 0.1% |
| 15000 | 2 | |
| 20000 | 3 | |
| 22000 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.22155 × 1010 | 1 | |
| 4200000000 | 1 | |
| 2500000000 | 1 | |
| 2400000000 | 1 | |
| 2127519898 | 1 | |
| 1100000000 | 1 | |
| 1000000000 | 1 | |
| 700000000 | 2 | |
| 600000000 | 1 | |
| 553632000 | 1 |
title_year
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 91 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2002.1112 |
| Minimum | 1916 |
|---|---|
| Maximum | 2016 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 1916 |
|---|---|
| 5-th percentile | 1978 |
| Q1 | 1999 |
| median | 2005 |
| Q3 | 2010 |
| 95-th percentile | 2015 |
| Maximum | 2016 |
| Range | 100 |
| Interquartile range (IQR) | 11 |
Descriptive statistics
| Standard deviation | 12.50241 |
|---|---|
| Coefficient of variation (CV) | 0.0062446132 |
| Kurtosis | 7.3909201 |
| Mean | 2002.1112 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -2.2877603 |
| Sum | 9415929 |
| Variance | 156.31026 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2009 | 252 | 5.4% |
| 2006 | 235 | 5.0% |
| 2008 | 222 | 4.7% |
| 2010 | 221 | 4.7% |
| 2011 | 215 | 4.6% |
| 2005 | 215 | 4.6% |
| 2014 | 214 | 4.6% |
| 2013 | 213 | 4.5% |
| 2004 | 206 | 4.4% |
| 2012 | 203 | 4.3% |
| Other values (81) | 2507 |
| Value | Count | Frequency (%) |
| 1916 | 1 | |
| 1920 | 1 | |
| 1925 | 1 | |
| 1927 | 1 | |
| 1929 | 2 | |
| 1930 | 1 | |
| 1932 | 1 | |
| 1933 | 2 | |
| 1934 | 1 | |
| 1935 | 1 |
| Value | Count | Frequency (%) |
| 2016 | 82 | 1.7% |
| 2015 | 183 | |
| 2014 | 214 | |
| 2013 | 213 | |
| 2012 | 203 | |
| 2011 | 215 | |
| 2010 | 221 | |
| 2009 | 252 | |
| 2008 | 222 | |
| 2007 | 197 |
aspect_ratio
Real number (ℝ)
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1255305 |
| Minimum | 1.18 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 1.18 |
|---|---|
| 5-th percentile | 1.78 |
| Q1 | 1.85 |
| median | 2.35 |
| Q3 | 2.35 |
| 95-th percentile | 2.35 |
| Maximum | 16 |
| Range | 14.82 |
| Interquartile range (IQR) | 0.5 |
Descriptive statistics
| Standard deviation | 0.63838629 |
|---|---|
| Coefficient of variation (CV) | 0.3003421 |
| Kurtosis | 377.18399 |
| Mean | 2.1255305 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 17.406589 |
| Sum | 9996.37 |
| Variance | 0.40753706 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2.35 | 2499 | |
| 1.85 | 1870 | |
| 1.37 | 97 | 2.1% |
| 1.78 | 79 | 1.7% |
| 1.66 | 63 | 1.3% |
| 1.33 | 37 | 0.8% |
| 2.2 | 14 | 0.3% |
| 2.39 | 14 | 0.3% |
| 16 | 8 | 0.2% |
| 2 | 4 | 0.1% |
| Other values (10) | 18 | 0.4% |
| Value | Count | Frequency (%) |
| 1.18 | 1 | < 0.1% |
| 1.2 | 1 | < 0.1% |
| 1.33 | 37 | 0.8% |
| 1.37 | 97 | |
| 1.44 | 1 | < 0.1% |
| 1.5 | 2 | < 0.1% |
| 1.66 | 63 | |
| 1.75 | 3 | 0.1% |
| 1.77 | 1 | < 0.1% |
| 1.78 | 79 |
| Value | Count | Frequency (%) |
| 16 | 8 | 0.2% |
| 2.76 | 3 | 0.1% |
| 2.55 | 2 | < 0.1% |
| 2.4 | 3 | 0.1% |
| 2.39 | 14 | 0.3% |
| 2.35 | 2499 | |
| 2.24 | 1 | < 0.1% |
| 2.2 | 14 | 0.3% |
| 2 | 4 | 0.1% |
| 1.85 | 1870 |
movie_fb_likes
Real number (ℝ)
ZEROS 
| Distinct | 836 |
|---|---|
| Distinct (%) | 17.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7779.7997 |
| Minimum | 0 |
|---|---|
| Maximum | 349000 |
| Zeros | 2086 |
| Zeros (%) | 44.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 181 |
| Q3 | 5000 |
| 95-th percentile | 41900 |
| Maximum | 349000 |
| Range | 349000 |
| Interquartile range (IQR) | 5000 |
Descriptive statistics
| Standard deviation | 19611.482 |
|---|---|
| Coefficient of variation (CV) | 2.520821 |
| Kurtosis | 40.309513 |
| Mean | 7779.7997 |
| Median Absolute Deviation (MAD) | 181 |
| Skewness | 4.9742692 |
| Sum | 36588398 |
| Variance | 3.8461023 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2086 | |
| 1000 | 103 | 2.2% |
| 11000 | 80 | 1.7% |
| 10000 | 79 | 1.7% |
| 12000 | 59 | 1.3% |
| 13000 | 58 | 1.2% |
| 2000 | 54 | 1.1% |
| 15000 | 51 | 1.1% |
| 14000 | 46 | 1.0% |
| 16000 | 46 | 1.0% |
| Other values (826) | 2041 |
| Value | Count | Frequency (%) |
| 0 | 2086 | |
| 4 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| 7 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 11 | 1 | < 0.1% |
| 12 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 16 | 1 | < 0.1% |
| 17 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 349000 | 1 | |
| 199000 | 1 | |
| 197000 | 1 | |
| 191000 | 1 | |
| 190000 | 1 | |
| 175000 | 1 | |
| 165000 | 1 | |
| 164000 | 1 | |
| 153000 | 1 | |
| 150000 | 1 |
country_UK
Boolean
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.3 KiB |
| False | |
|---|---|
| True | 420 |
| Value | Count | Frequency (%) |
| False | 4283 | |
| True | 420 | 8.9% |
country_USA
Boolean
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 41.3 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 3575 | |
| False | 1128 | 24.0% |
other_actors_fb_likes
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 2041 |
|---|---|
| Distinct (%) | 43.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2380.9303 |
| Minimum | 0 |
|---|---|
| Maximum | 137748 |
| Zeros | 32 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 50 |
| Q1 | 480.5 |
| median | 1017 |
| Q3 | 1581 |
| 95-th percentile | 13000 |
| Maximum | 137748 |
| Range | 137748 |
| Interquartile range (IQR) | 1100.5 |
Descriptive statistics
| Standard deviation | 5262.0495 |
|---|---|
| Coefficient of variation (CV) | 2.2100813 |
| Kurtosis | 107.20966 |
| Mean | 2380.9303 |
| Median Absolute Deviation (MAD) | 547 |
| Skewness | 6.8977295 |
| Sum | 11197515 |
| Variance | 27689165 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 32 | 0.7% |
| 2000 | 30 | 0.6% |
| 3000 | 19 | 0.4% |
| 12000 | 16 | 0.3% |
| 15000 | 16 | 0.3% |
| 13000 | 15 | 0.3% |
| 4000 | 15 | 0.3% |
| 14000 | 14 | 0.3% |
| 22000 | 13 | 0.3% |
| 24000 | 12 | 0.3% |
| Other values (2031) | 4521 |
| Value | Count | Frequency (%) |
| 0 | 32 | |
| 2 | 10 | 0.2% |
| 3 | 3 | 0.1% |
| 4 | 5 | 0.1% |
| 5 | 7 | 0.1% |
| 6 | 2 | < 0.1% |
| 7 | 4 | 0.1% |
| 8 | 8 | 0.2% |
| 9 | 6 | 0.1% |
| 10 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 137748 | 1 | < 0.1% |
| 50000 | 1 | < 0.1% |
| 46000 | 1 | < 0.1% |
| 42000 | 1 | < 0.1% |
| 40000 | 2 | < 0.1% |
| 39000 | 1 | < 0.1% |
| 38000 | 1 | < 0.1% |
| 37000 | 2 | < 0.1% |
| 36000 | 8 | |
| 35000 | 1 | < 0.1% |
critic_reviews_ratio
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 3844 |
|---|---|
| Distinct (%) | 81.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8893224 |
| Minimum | 0.037037037 |
|---|---|
| Maximum | 25 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 73.5 KiB |
Quantile statistics
| Minimum | 0.037037037 |
|---|---|
| 5-th percentile | 0.2016469 |
| Q1 | 0.38334609 |
| median | 0.62222222 |
| Q3 | 1.0902912 |
| 95-th percentile | 2.2437546 |
| Maximum | 25 |
| Range | 24.962963 |
| Interquartile range (IQR) | 0.7069451 |
Descriptive statistics
| Standard deviation | 1.0070265 |
|---|---|
| Coefficient of variation (CV) | 1.1323525 |
| Kurtosis | 161.81569 |
| Mean | 0.8893224 |
| Median Absolute Deviation (MAD) | 0.29255189 |
| Skewness | 9.1138071 |
| Sum | 4182.4832 |
| Variance | 1.0141023 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 43 | 0.9% |
| 0.5 | 29 | 0.6% |
| 0.3333333333 | 19 | 0.4% |
| 2 | 19 | 0.4% |
| 0.6666666667 | 16 | 0.3% |
| 1.5 | 13 | 0.3% |
| 0.8 | 12 | 0.3% |
| 0.4 | 11 | 0.2% |
| 0.5714285714 | 10 | 0.2% |
| 3 | 8 | 0.2% |
| Other values (3834) | 4523 |
| Value | Count | Frequency (%) |
| 0.03703703704 | 1 | |
| 0.04802123552 | 1 | |
| 0.05263157895 | 1 | |
| 0.05869565217 | 1 | |
| 0.0625 | 1 | |
| 0.06482504604 | 1 | |
| 0.07474352711 | 1 | |
| 0.07575757576 | 1 | |
| 0.07692307692 | 1 | |
| 0.07851239669 | 1 |
| Value | Count | Frequency (%) |
| 25 | 1 | |
| 21 | 2 | |
| 18 | 1 | |
| 9.428571429 | 1 | |
| 9 | 1 | |
| 8.75 | 1 | |
| 8.5 | 1 | |
| 8 | 2 | |
| 7 | 1 | |
| 6.759259259 | 1 |
imdb_classification
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 73.5 KiB |
| 2 | |
|---|---|
| 1 | |
| 3 | 204 |
| 0 | 154 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 4703 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 2 |
| 3rd row | 2 |
| 4th row | 3 |
| 5th row | 2 |
Common Values
| Value | Count | Frequency (%) |
| 2 | 3018 | |
| 1 | 1327 | |
| 3 | 204 | 4.3% |
| 0 | 154 | 3.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2 | 3018 | |
| 1 | 1327 | |
| 3 | 204 | 4.3% |
| 0 | 154 | 3.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 3018 | |
| 1 | 1327 | |
| 3 | 204 | 4.3% |
| 0 | 154 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 4703 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3018 | |
| 1 | 1327 | |
| 3 | 204 | 4.3% |
| 0 | 154 | 3.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 4703 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3018 | |
| 1 | 1327 | |
| 3 | 204 | 4.3% |
| 0 | 154 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 4703 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 3018 | |
| 1 | 1327 | |
| 3 | 204 | 4.3% |
| 0 | 154 | 3.3% |
| actor_1_fb_likes | aspect_ratio | budget | country_UK | country_USA | critic_reviews_ratio | director_fb_likes | duration | facenumber_in_poster | gross | imdb_classification | movie_fb_likes | num_voted_users | other_actors_fb_likes | title_year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| actor_1_fb_likes | 1.000 | 0.145 | 0.389 | 0.000 | 0.015 | -0.104 | 0.143 | 0.212 | 0.116 | 0.317 | 0.029 | 0.112 | 0.432 | 0.737 | 0.119 |
| aspect_ratio | 0.145 | 1.000 | 0.266 | 0.000 | 0.000 | 0.082 | 0.057 | 0.216 | 0.035 | 0.094 | 0.000 | 0.076 | 0.123 | 0.115 | 0.290 |
| budget | 0.389 | 0.266 | 1.000 | 0.000 | 0.050 | -0.138 | 0.173 | 0.336 | 0.026 | 0.579 | 0.000 | 0.097 | 0.501 | 0.384 | 0.143 |
| country_UK | 0.000 | 0.000 | 0.000 | 1.000 | 0.556 | 0.025 | -0.012 | 0.054 | 0.003 | -0.094 | 0.108 | 0.000 | 0.007 | -0.067 | -0.017 |
| country_USA | 0.015 | 0.000 | 0.050 | 0.556 | 1.000 | -0.125 | 0.050 | -0.048 | 0.033 | 0.269 | 0.106 | 0.024 | 0.116 | 0.266 | -0.051 |
| critic_reviews_ratio | -0.104 | 0.082 | -0.138 | 0.025 | -0.125 | 1.000 | -0.064 | -0.234 | 0.055 | -0.293 | 0.004 | 0.106 | -0.325 | -0.125 | 0.580 |
| director_fb_likes | 0.143 | 0.057 | 0.173 | -0.012 | 0.050 | -0.064 | 1.000 | 0.199 | 0.008 | 0.161 | 0.139 | 0.043 | 0.256 | 0.117 | -0.019 |
| duration | 0.212 | 0.216 | 0.336 | 0.054 | -0.048 | -0.234 | 0.199 | 1.000 | 0.049 | 0.244 | 0.209 | 0.107 | 0.358 | 0.169 | -0.075 |
| facenumber_in_poster | 0.116 | 0.035 | 0.026 | 0.003 | 0.033 | 0.055 | 0.008 | 0.049 | 1.000 | -0.022 | 0.007 | -0.012 | -0.041 | 0.113 | 0.064 |
| gross | 0.317 | 0.094 | 0.579 | -0.094 | 0.269 | -0.293 | 0.161 | 0.244 | -0.022 | 1.000 | 0.122 | 0.104 | 0.627 | 0.355 | 0.022 |
| imdb_classification | 0.029 | 0.000 | 0.000 | 0.108 | 0.106 | 0.004 | 0.139 | 0.209 | 0.007 | 0.122 | 1.000 | 0.108 | 0.373 | -0.008 | -0.127 |
| movie_fb_likes | 0.112 | 0.076 | 0.097 | 0.000 | 0.024 | 0.106 | 0.043 | 0.107 | -0.012 | 0.104 | 0.108 | 1.000 | 0.215 | 0.098 | 0.273 |
| num_voted_users | 0.432 | 0.123 | 0.501 | 0.007 | 0.116 | -0.325 | 0.256 | 0.358 | -0.041 | 0.627 | 0.373 | 0.215 | 1.000 | 0.378 | 0.029 |
| other_actors_fb_likes | 0.737 | 0.115 | 0.384 | -0.067 | 0.266 | -0.125 | 0.117 | 0.169 | 0.113 | 0.355 | -0.008 | 0.098 | 0.378 | 1.000 | 0.105 |
| title_year | 0.119 | 0.290 | 0.143 | -0.017 | -0.051 | 0.580 | -0.019 | -0.075 | 0.064 | 0.022 | -0.127 | 0.273 | 0.029 | 0.105 | 1.000 |
| duration | director_fb_likes | actor_1_fb_likes | gross | num_voted_users | facenumber_in_poster | budget | title_year | aspect_ratio | movie_fb_likes | country_UK | country_USA | other_actors_fb_likes | critic_reviews_ratio | imdb_classification | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 178.0 | 0.0 | 1000.0 | 760505847.0 | 886204 | 0.0 | 237000000.0 | 2009.0 | 1.78 | 33000 | False | True | 1791.0 | 0.236739 | 2 |
| 1 | 169.0 | 563.0 | 40000.0 | 309404152.0 | 471220 | 0.0 | 300000000.0 | 2007.0 | 2.35 | 0 | False | True | 6000.0 | 0.243942 | 2 |
| 2 | 148.0 | 0.0 | 11000.0 | 200074175.0 | 275868 | 1.0 | 245000000.0 | 2015.0 | 2.35 | 85000 | True | False | 554.0 | 0.605634 | 2 |
| 3 | 164.0 | 22000.0 | 27000.0 | 448130642.0 | 1144337 | 0.0 | 250000000.0 | 2012.0 | 2.35 | 164000 | False | True | 46000.0 | 0.301000 | 3 |
| 5 | 132.0 | 475.0 | 640.0 | 73058679.0 | 212204 | 1.0 | 263700000.0 | 2012.0 | 2.35 | 24000 | False | True | 1162.0 | 0.626016 | 2 |
| 6 | 156.0 | 0.0 | 24000.0 | 336530303.0 | 383056 | 0.0 | 258000000.0 | 2007.0 | 2.35 | 0 | False | True | 15000.0 | 0.206099 | 2 |
| 7 | 100.0 | 15.0 | 799.0 | 200807262.0 | 294810 | 1.0 | 260000000.0 | 2010.0 | 1.85 | 29000 | False | True | 837.0 | 0.837209 | 2 |
| 8 | 141.0 | 0.0 | 26000.0 | 458991599.0 | 462669 | 4.0 | 250000000.0 | 2015.0 | 2.35 | 118000 | False | True | 40000.0 | 0.568487 | 2 |
| 9 | 153.0 | 282.0 | 25000.0 | 301956980.0 | 321795 | 3.0 | 250000000.0 | 2009.0 | 2.35 | 10000 | True | False | 21000.0 | 0.385406 | 2 |
| 10 | 183.0 | 0.0 | 15000.0 | 330249062.0 | 371639 | 0.0 | 250000000.0 | 2016.0 | 2.35 | 197000 | False | True | 6000.0 | 0.222995 | 2 |
| duration | director_fb_likes | actor_1_fb_likes | gross | num_voted_users | facenumber_in_poster | budget | title_year | aspect_ratio | movie_fb_likes | country_UK | country_USA | other_actors_fb_likes | critic_reviews_ratio | imdb_classification | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 5026 | 110.0 | 107.0 | 576.0 | 136007.0 | 3924 | 1.0 | 4500.0 | 2004.0 | 2.35 | 171 | False | False | 178.0 | 2.076923 | 2 |
| 5027 | 90.0 | 397.0 | 5.0 | 673780.0 | 4555 | 0.0 | 10000.0 | 2000.0 | 1.85 | 697 | False | False | 0.0 | 2.461538 | 2 |
| 5029 | 111.0 | 62.0 | 89.0 | 94596.0 | 6318 | 0.0 | 1000000.0 | 1997.0 | 1.85 | 817 | False | False | 19.0 | 1.560000 | 2 |
| 5032 | 98.0 | 3.0 | 789.0 | 24848292.0 | 438 | 1.0 | 20000000.0 | 1995.0 | 2.35 | 20 | False | True | 346.0 | 0.714286 | 2 |
| 5033 | 77.0 | 291.0 | 291.0 | 424760.0 | 72639 | 0.0 | 7000.0 | 2004.0 | 1.85 | 19000 | False | True | 53.0 | 0.385445 | 2 |
| 5034 | 80.0 | 0.0 | 0.0 | 70071.0 | 589 | 0.0 | 7000.0 | 2005.0 | 2.35 | 74 | False | False | 0.0 | 1.000000 | 2 |
| 5035 | 81.0 | 0.0 | 121.0 | 2040920.0 | 52055 | 0.0 | 7000.0 | 1992.0 | 1.37 | 0 | False | True | 26.0 | 0.430769 | 2 |
| 5037 | 95.0 | 0.0 | 296.0 | 4584.0 | 1338 | 1.0 | 9000.0 | 2011.0 | 2.35 | 413 | False | True | 338.0 | 1.000000 | 2 |
| 5038 | 87.0 | 2.0 | 637.0 | 24848292.0 | 629 | 2.0 | 20000000.0 | 2013.0 | 2.35 | 84 | False | False | 788.0 | 0.166667 | 2 |
| 5042 | 90.0 | 16.0 | 86.0 | 85222.0 | 4285 | 0.0 | 1100.0 | 2004.0 | 1.85 | 456 | False | True | 39.0 | 0.511905 | 2 |
Most frequently occurring
| duration | director_fb_likes | actor_1_fb_likes | gross | num_voted_users | facenumber_in_poster | budget | title_year | aspect_ratio | movie_fb_likes | country_UK | country_USA | other_actors_fb_likes | critic_reviews_ratio | imdb_classification | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 101.0 | 3.0 | 448.0 | 12189514.0 | 30092 | 0.0 | 23000000.0 | 2004.0 | 2.35 | 0 | False | True | 263.0 | 0.503876 | 2 | 2 |